Assembly of polymorphic genomes: algorithms and application to Ciona savignyi.

نویسندگان

  • Jade P Vinson
  • David B Jaffe
  • Keith O'Neill
  • Elinor K Karlsson
  • Nicole Stange-Thomann
  • Scott Anderson
  • Jill P Mesirov
  • Nori Satoh
  • Yutaka Satou
  • Chad Nusbaum
  • Bruce Birren
  • James E Galagan
  • Eric S Lander
چکیده

Whole-genome assembly is now used routinely to obtain high-quality draft sequence for the genomes of species with low levels of polymorphism. However, genome assembly remains extremely challenging for highly polymorphic species. The difficulty arises because two divergent haplotypes are sequenced together, making it difficult to distinguish alleles at the same locus from paralogs at different loci. We present here a method for assembling highly polymorphic diploid genomes that involves assembling the two haplotypes separately and then merging them to obtain a reference sequence. Our method was developed to assemble the genome of the sea squirt Ciona savignyi, which was sequenced to a depth of 12.7 x from a single wild individual. By comparing finished clones of the two haplotypes we determined that the sequenced individual had an extremely high heterozygosity rate, averaging 4.6% with significant regional variation and rearrangements at all physical scales. Applied to these data, our method produced a reference assembly covering 157 Mb, with N50 contig and scaffold sizes of 47 kb and 989 kb, respectively. Alignment of ESTs indicates that 88% of loci are present at least once and 81% exactly once in the reference assembly. Our method represented loci in a single copy more reliably and achieved greater contiguity than a conventional whole-genome assembly method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The C. savignyi genetic map and its integration with the reference sequence facilitates insights into chordate genome evolution.

The urochordate Ciona savignyi is an emerging model organism for the study of chordate evolution, development, and gene regulation. The extreme level of polymorphism in its population has inspired novel approaches in genome assembly, which we here continue to develop. Specifically, we present the reconstruction of all of C. savignyi's chromosomes via the development of a comprehensive genetic m...

متن کامل

Inverse Correlation of Population Similarity and Introduction Date for Invasive Ascidians

The genomes of many marine invertebrates, including the purple sea urchin and the solitary ascidians Ciona intestinalis and Ciona savignyi, show exceptionally high levels of heterozygosity, implying that these populations are highly polymorphic. Analysis of the C. savignyi genome found little evidence to support an elevated mutation rate, but rather points to a large population size contributin...

متن کامل

Exploiting the extraordinary genetic polymorphism of ciona for developmental genetics with whole genome sequencing.

Studies in tunicates such as Ciona have revealed new insights into the evolutionary origins of chordate development. Ciona populations are characterized by high levels of natural genetic variation, between 1 and 5%. This variation has provided abundant material for forward genetic studies. In the current study, we make use of deep sequencing and homozygosity mapping to map spontaneous mutations...

متن کامل

Non-coding RNAs in Ciona intestinalis

MOTIVATION The analysis of animal genomes showed that only a minute part of their DNA codes for proteins. Recent experimental results agree, however, that a large fraction of these genomes are transcribed and hence are probably functional at the RNA level. A computational survey of vertebrate genomes has predicted thousands of previously unknown ncRNAs with evolutionarily conserved secondary st...

متن کامل

Chaining Algorithms for Alignment of Draft Sequence

In this paper we propose a chaining method that can align a draft genomic sequence against a finished genome. We introduce the use of an overlap tree to enhance the state information available to the chaining procedure in the context of sparse dynamic programming, and demonstrate that the resulting procedure more accurately penalizes the various biological rearrangements. The algorithm is teste...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome research

دوره 15 8  شماره 

صفحات  -

تاریخ انتشار 2005